Detection of Characteristic Co-Occurrence Words from News Articles on the Web
نویسندگان
چکیده
A large number of news articles are published on the Web every day, and demand of discovering news articles on new/important topics has been growing. In this paper, we present a method for detecting characteristic words co-occurring with a target word (characteristic co-occurrence words) to help users find important topics related to the target word. The method divides news articles published in a certain period of time into two groups by whether the target word is included or not, then computes score of each word co-occurring with the target word in some news articles by counting the number of news articles including the co-occurring word for each of the news article groups. We can detect characteristic co-occurrence words more effectively by clustering news articles in advance and computing the score only in clusters which news articles including the target word belong to.
منابع مشابه
News-Topic Oriented Hashtag Recommendation in Twitter Based on Characteristic Co-occurrence Word Detection
Hashtags, which started to be widely used since 2007, are always utilized to mark keywords in tweets to categorize messages and form conversation for topics in Twitter. However, it is hard for users to use hashtags for sharing their opinions/interests/comments for their interesting topics. In this paper, we present a new approach for recommending news-topic oriented hashtags to help Twitter use...
متن کاملThe analysis of co-citation and word co-occurrence networks of Iranian articles in the field of dentistry
Background and Aims: Dentistry is an important profession ensuring the health of body and soul, and has a special place in the scientific productions of medical disciplines. The purpose of this study was to analyze the co-citation and word co-occurrence of Iranian research papers in the field of dentistry based on indexed documents in Web of Science from 2014 to 2018. Materials and Methods:...
متن کاملA New Document Embedding Method for News Classification
Abstract- Text classification is one of the main tasks of natural language processing (NLP). In this task, documents are classified into pre-defined categories. There is lots of news spreading on the web. A text classifier can categorize news automatically and this facilitates and accelerates access to the news. The first step in text classification is to represent documents in a suitable way t...
متن کاملVisualizing Multiple System Atrophy Studies Based on Collaboration Network and Centrality Indices in Web of Science Database
Introduction: Social network analysis is an analytical method based on graph theories that identifies relationships between individuals or factors to analyze the social structures resulted from those relationships. The objective of this study was to analyze co-authorship and co-word networks based on scientometric indicators and centrality measures in the studies on multiple atrophy system dise...
متن کاملVisualizing Multiple System Atrophy Studies Based on Collaboration Network and Centrality Indices in Web of Science Database
Introduction: Social network analysis is an analytical method based on graph theories that identifies relationships between individuals or factors to analyze the social structures resulted from those relationships. The objective of this study was to analyze co-authorship and co-word networks based on scientometric indicators and centrality measures in the studies on multiple atrophy system dise...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011